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Abstract 

Let E = {(eij)) n xn be a fixed array of real numbers such that ey = eji, en = for 1 < i, j < n. Let 
the symmetric group be denoted by S n and the collection of involutions with no fixed points by LT n , that 
is, II n — {it £ S n : TV 2 — id,ir(i) 7^ iVi}. For tv uniformly chosen from IT n , let Ye = Y27=i e ^(i) 
W = (Ye — lle)/<Je where he = E(Ye) and cr% = Vslt(Ye)- Denoting by Fw and $ the distribution 
functions of W and a J\f(0, 1) variate respectively, we bound \\Fw — $\\p for 1 < p < 00 using Stein's 
method and zero bias transformations. The resulting bound obtained is the product of a third moment 
type quantity multiplied by an explicit constant, and in particular for p — 00 is of the same form as 
the one obtained by Bolthausen for Hoeffding's combinatorial central limit theorem when n is chosen 
uniformly from S„- The approximation developed here for involutions has applications in testing whether 
there is a significant degree of similarity in certain matched pairs experiments. 



1 Introduction 

Let E = {(e-ij)) be an n x n array of real numbers. The study of combinatorial central limit theorems, that 
is, central limit theorems for random variables of the form 

n 

usually focuses on the case where it is a permutation chosen uniformly from the set of all permutations of 
n elements. Approximating the distribution of Ye by the normal when tt is chosen uniformly began with 
the work of Wald and Wolfowitz [14] . who, motivated by approximating null distributions for permutation 
test statistics, proved the central limit theorem as n — > 00 for the case where the factorization = biCj 
holds. In this special case, when bi are numerical characteristics of some population, and k of the c/s are 1, 
and the remaining n — k are zero, Ye has the distribution of a simple random sample from the population. 
General arrays were handled in the work of Hocffding 8_ , and in Motoo obtained Lindeberg-type conditions 
which are sufficient for the normal limit in |10) . 

A number of later authors refined the limiting results and obtained information on the rate of convergence 
and bounds on the error in the normal approximation, typically in the supremum, or L°°, norm. Ho 
and Chen [7] and von Bahr [13] derived L°° bounds when the matrix E is random, the former using a 
concentration inequality approach and Stein's method, but which yield the correct rate 0(n -1 / 2 ) only when 
certain boundedness conditions on sup i • \e^j \ are satisfied. Bolthausen pQ obtained a bound of the right order 
in terms of third-moment type quantities on E, but with an unspecified constant. Goldstein 4J employing 
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the zero bias version of Stein's method obtained bounds of the correct order with an explicit constant, but in 
terms of supj j \eij\- Overall then, the work of Bolthausen 1 |, proceeding inductively, is the only one which 
yields a bound in terms of third moment quantities on E without the need for conditions on sup,j 
Though an explicit constant was not obtained in [JJ, as we will see here for a related case, this inductive 
method can yield explicit constants when the calculations are tracked carefully. 

The case where it is chosen uniformly from the symmetric group is not the only case of practical interest. 
In [llj a permutation test is considered for a certain matched pair experiment, which is designed to answer 
the question of whether there is an unusually high degree of similarity in a distinguished pairing, when there 
is some unknown background, or baseline, similarity between all pairs. In such a case one considers Ye as 
in ((TJ) when n is chosen uniformly from II ra , the class of involutions without fixed points, 

n n = {neS n :TT 2 =id,\Ji:Tr(i)^i}, (2) 

where S n is the symmetric group of some n elements and id the identity permutation. Since the distribution 
of Ye is complicated, L°° bounds for the error in normal approximation enables one to test the significance 
of a matched pair. The interested reader can look into [5J for a discussion on similar applications. 

A bound to the normal for this case was provided in the L°° norm by [6 , with explicit constants, but 
under a boundedness assumption. In this paper, we follow the method of [JJ to refine the results of [5] and 
provide a bound to the normal for the involution case, with an explicit constant, in terms of third moment 
type quantities on the matrix E, and for all L p for 1 < p < oo. In particular, with K p given explicitly by 
(fTTJ . we show that the LP norm of the difference between W, the variable Ye standardized, and the normal 
satisfies 

\\F W -n P <K P — 

n 

for all n > 9, where /3e is given in (|lip . Although the constant K p is quite large in magnitude, it is the first of 
its kind in the literature. Also we improve upon Goldstein and Rinott's result by using the idea of zero bias 
transformations introduced in [5] and prove CLT of order 0(n -1 / 2 ) under much milder conditions. It should 
be noted that the method applied here can be adopted to give LP estimates in Hoeffding's combinatorial 
CLT as well, and will yield a bound of the same from as the one obtained in [TJ. 

Other work where 7r is not uniform includes that of [9], where the permutations are uniform over those 
having one long cycle, and [4] , where L°° bounds are derived under a boundedness condition for the case of 
a permutation distribution constant on conjugacy classes having no fixed points, which generalizes both the 
involution and long cycle cases. 

The paper is organized as follows. In Section 2, we introduce some notation and state our main result. In 
Section 3 the basic idea of the zero bias transformation is reviewed, and an outline is provided that illustrates 
how to obtain zero bias couplings in some cases of interest. In Section 4, L 1 bounds are obtained. Lastly, in 
Section 5 we use the calculations in Section 4 along with the recursive argument of [TJ to obtain L°° bounds. 
From the L 1 and L°° bounds, the following simple inequality allows for the computation of LP bounds for 
all intermediate p € (1, oo), 

ll/ll^ll/IIS^II/lli. (3) 

2 Notation and statement of main result 

For n an even positive integer, let 7r be a permutation chosen uniformly from II„ in ([2]), the set of involutions 
with no fixed points. Since for ir € H n the terms e^^) and e w u\i always appear together in (JTJ), and en never 
appears, we may assume without loss of generality that 

eij = eji and that en — for all i, j = 1, 2, . . . , n. (4) 

For an array E = ((eij))i<i J <, i satisfying the conditions in ([4]), define 

n n n 

e i+ = e ij> e +j = X! and e ++ = e ^ ■ 

3=1 i=l <>j'=l 
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Then, as shown in [BJ, 



Again following [BJ, letting 



(5) 



e +4 



n-2 n-2 1 (n-l)(n-2) / J /g\ 
« = j, 

we have 

e +t = e j+ = e ++ = for all i, j = 1, . . . , n. (7) 

and that 

Yg = Y E - I-ie 

where E is obtained from E by ([BJ. 

We consider bounds to the normal 7V(0, 1) for the standardized variable 

w= Ye^e (g) 
Since F# and Yg differ by a constant, ([5]) and (O yield 



2 2 2(n-2) 



-8) 4- ( 9 ) 

l<ij<" 

In particular, the mean zero, variance 1 random variable W in (|8]) can be written as 

W = ^ d iTr(i) where = ey/crg, (10) 

and moreover, the array -E inherits properties (j4]) and ([7]) from £7, as then does D from E. For any array 
-B = ((eij)) nXn , let 

ig. .13 

/3_e = >J with eij as in (JHJ) and crfj as in (0. (11) 

In order to guarantee that IF is a well defined random variable, that is, that a\ > 0, we henceforth impose 
the following condition without further mention. 

Condition 2.1. For some i ^ j, the value ^ 0, or, equivalently, 

ei+ e +j e ++ 



n-2 n-2 (n-l)(ra-2) 



7^ /or some i =/= j ■ 



Since a\j and are linearly related, Condition [2TT] is equivalent to the condition that dy 7^ for some i =/= j. 

We provide bounds on the accuracy of the normal approximation of W using the zero bias transformation, 
introduced in [5]. For any random variable W with mean zero and variance cr 2 , there exists a unique 
distribution for a random variable W* with the property that for any differentiable function / 

E[Wf(W)} = a 2 Ef\W*). (12) 
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The distribution of the random variate W* is called the zero bias transform of the distribution of W. From 
Stein's original lemma [12] , W is Af(0, a 2 ) if and only if W* is Af(0, <r 2 ), that is, the normal distribution is 
the unique fixed point of the zero bias transformation. This gives rise to the intuition that if W and W* 
are close, in some appropriate sense, then the distribution of W should be close to the normal. That this 
intuition is indeed true can be seen, for instance, in the following result from [3J: if W and W* are on a joint 
space, with W* having the W zero bias distribution, then 

\\F W - $||i < 2E\W* - W\. (13) 

In (fl~3)) , Fw and $ denote the distribution functions of W and that of a standard normal variate respectively, 
and || • ||p denotes the L p norm. We call a construction of W and W* on a joint space a zero bias coupling 
ofWtoW*. 

The following is our main result which we prove using zero bias coupling. 

Theorem 2.1. Let E = ((eij)) nxn be an array satisfying — eji,ea — 0Vi,j, and let it be an involution 
chosen uniformly from U n . If 

n 

Ye = eijr(»)> 

i=l 

and W = (Ye — [Ie)Io~e, then for n > 9 and p G [l,oo], with (3e cls in ill]), we have 

\\F W -m P <K p ^. 

n 

Here Fw denotes the distribution function of W , is the distribution function of a standard normal variate 
and 

K p = (379) 1/p (61, 702, 446) 1 ~ 1/p . (14) 

As W in the theorem is given by (|10p with 

dij = dji,da = 0, di+ = and a D = 1; Pe = Pd (15) 

we assume in what follows that all subsequent occurrences of ((dij)) satisfy these conditions and instead of 
working with E work with the centered and scaled array D only. 

In the next section, we review construction of zero bias couplings in certain cases of interest including 
the present problem. 



3 Zero bias transformation 

We prove Theorem 12.11 by constructing a zero bias coupling using a Stein pair, that is, a pair of random 
variables (W, W) which are exchangeable and satisfy 

E(W — W'\W) = XW for some A € (0,1); (16) 

see [2] for more on Stein pairs. 

As shown in [5], for any mean zero, variance a 2 random variable W, there exists a distribution for a 
random variable W* satisfying (|12j) . Nevertheless, constructing useful couplings of W and W* for particular 
examples may be difficult. In some cases, however, as in ours, the following proposition from [3] may be 
applied. 

Proposition 3.1. Let W, W be an exchangeable pair with Var(W) = a 2 6 (0, oo) and distribution F(w, w') 
satisfying the linearity condition U6\) . Then 

E(W~W') 2 = 2Acr 2 , (17) 
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and when (W^ , W*) has the joint distribution 

dF\ W , W >) = j^ w ^ )2 dF{ W , W >) (18) 

and U ~ 1A[0,1] is independent of W\ , the variable 

W* = UW f + (1 - U)W i 

has the W -zero biased distribution. 

In particular, construction of zero bias couplings are typically possible when Stein pairs exist. We review 
the construction of (W^, W*) from (W, W) as outlined in [3]. Suppose we have a Stein pair (W, W) which 
is a function of some collection of random variables {S a ,a € x}; an d that for a possibly random index set 
I C X , independent of {S Q , a G x}, the difference W — W depends only on I and on {S Q , a G xi}, where 
Xi C x depends on I. That is, for some function b(i, 3 a , a G Xi) defined on i C x, and I a random index set, 

W-W = b(I,E a ,ae X i)- (19) 

We now show how, under this framework, the pair (W, W) can be constructed; the pair (W^VF*) will 
then be constructed in a similar fashion. First generate I, then independently generate {E a ,a G xi} an d 
finally {S Q , a G xi} conditioned on {S Q , a € xi}- That is, first generate the indices I on which the difference 
W — W' depends, then the underlying variables S Q ,a G xi which make up that difference, and lastly the 
remaining variables. This construction corresponds to the following factorization of the joint distribution of 
I and G x} as t ne product 

dF(U„,aex) =P(l = i)dFi{Z a ,ae X i)dF iali (Z a ,a<txi\Zc l ,aexi)- (20) 
For dF^ we consider the joint distribution of I and {S Q , a G x}, biased by the squared difference (w — w') 2 , 



is 



dFt(i,^,a G x) = x)- (21) 



From (fTT)) . (TT5]) and the independence of I and {S Q , a € x} we obtain 

P(l = i)M> 2 (i, 5 Q , a G Xi) = 2A ( r 2 . (22) 

iCx 

Hence we can define a probability distribution for an index set 1^ by 

P(lt=i) = _p_ where n = P(l = i)£6 2 (i, H Q , a G Xi)- (23) 
2Act z 

From dl9l), dH]) and ([23]), we obtain 

dFt(i,^ a ,a G x) - 6 (l '^' a a eXl) P(I = G Xi)^|i(^," ^ G Xi) 

r s 6 2 (i,£ Q ,a G Xi) , Wt ^ w „ , c - , t ^ ^ 

-d-ti(t, a ,a G Xi)«^i<=|i(?a,a £ Xi|?a,a € Xi) 



where 



2A<7 2 Sfe 2 (i,S Q ,aexi) 

= P(lt = i)^(( a , tt e x,)dFi.|i(£ a ,a £ Xi|^,« G Xi) (24) 



di? , a G Xi) = jTil: ^ ' a € *° , dJi , a G Xi) ■ (25) 
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Note that ([Ml) gives a representation of the distribution which is of the same form as (f^Uj) . The parallel 
forms of F and F 1 * allow us to generate variables having distribution F^ parallel to that for distribution F; 
first generate the random index I^, then {— a £ x\} according to dF^ , and lastly the remaining variables 
according to dFW^Q,, a £ xM 

That the last step is the same in the construction of pairs of variables having the F and F' distribution 
allows an opportunity for a coupling between (W,W') and {W\ W*) to be achieved; from Proposition 13. 1[ 
this suffices to couple W and W*. One coupling may be accomplished using the following outline, as was 
done in [3J. First generate I and {S Q ,a £ x}, yielding the pair W,W'. Next generate it and then the 
variables {—J, : a £ Xi+ } following c£F; if the realization of 1^ is i. Lastly, when constructing the remaining 
variables that is {_t : a £ Xi} which make up W*,W*, use as much of the previously generated variables 
{_ a , a ^ Xi} as possible so that the pairs (W, W) and {W\ W^) will be close. 

We review the construction of (W, W) in jB] for the case at hand, and then show how it agrees with the 
outline above. For distinct i,j £ {1, . . . ,n} let Tij be the permutation which transposes the elements i and 
j, that is, Tij(i) — j, Tij(j) — i and Tij(k) = k for all k (jL {i,j, }. Furthermore, given tt £ n„, let 

7T 

i,j ^) 7r (i)T?\' 7r (*) ' 

Note that for any given tt £ II„, the permutation irafj will again belong to IT n . In particular, whereas 
7r has the cycle(s) (i,n(i)) and (J, ir(j)), the permutation 7TQ!? - has cycle(s) and (7r(z), 7t(j)), and all 

other cycles in common with tt. Now, with 7r chosen uniformly from IT„ and W given by IjlOp . we construct 
an exchangeable pair (W, W), as in [BJ, as follows. Choose two distinct indices I = (I, J) uniformly from 
{1,2, ...,n}, that is, having distribution 

P(I = i) = P(/ = i,J = j) = 1 l(i f j) (26) 

n(n — 1) 



and let 



(27) 



Since (I, J) is chosen uniformly over all distinct pairs, tt and tt' are exchangeable, and hence, letting W' = 
5_ ^iir'(i)) so are (Wj W') - Moreover, as 7r' has the cycle(s) (J, J) and (tt(I),tt(J)), and shares all other cycles 
with tt, we have 

- W" = 2{d I<I) + d Jw(J) - (rf/j + d <IW) )). (28) 
Using p8]) it is shown in [6J that 

E(W-W'\W) = -W, (29) 
n 

that is, (fT6"|) is satisfied with A = 4/n. 

To put this construction in the framework above, so to be able to apply the decomposition (|24l) for 
the construction of a zero bias coupling, let x = {1> 2, ...,n} and _ Q = 7r(a). From ([25]l. we see that the 
difference W — W depends on a pair of randomly chosen indices, and their images. Hence, regarding these 
indices, let i = £ x 2 an d let I = (7, J) where I and J have joint distribution given in (|26|) . specifying 
P(I = i), the first term in ([20)) . Also, in this case Xi = i< Next, for given distinct i,j and tt £ n„, we have 
Tr(i) 7r(j), ir(i) ^ i and tt(J) ^ j. As tt is chosen uniformly from IT n , the distribution of the images k = £j 
and ^ = £j of distinct i and j under 7r is given by 

dF i (£ ol ,a£i)=dF itj (k,l)cxl(k^l,k^i,l^j) for i ^ j, (30) 

specifying the second term of (|2T))) . The last term in (j2"0")) is given by 

, P ^ ™_5l£ ^cil P(tt(z) = fc,7r(j) = /,7r(q) = g a ,a g {i,j,fc,*» 
dFielife^il^aei) = __________ , (31) 



G 



when i j and k ^ I. The equality in (I31|) follows from the fact that tt is an involution, implying 
£,k = 7r(fe) = 7r(7r(z)) = i and similarly for = j, and thus 

{7r(i) = fc,7r(i) = Z,vr(a) = (, a ,a {i, j}} 
= {7r(i) = k,n(j) =l,w(k) =i,7r(0 =j',7r(a) = £ Q , a £ {«, j, Z«,Z}} 
= {tt(i) = fc,7r(j) = Z,7r(a) = £ Q ,a £ {«, j, M}}- 

The conditional distribution in pip can be simplified further by considering the two cases k = j or equiva- 
lently Z = i and k ^ j ot equivalently k, l}\ = 4 separately. If fe = j then we have Z = i and thus we 

obtain 



p (n(i) = 3,n(j) = i) 
P(tt(i) = j,7r(a) = ^ {i,j}) 

P(7T(i)=i) 

= r S^ T = |n n _ 2 r 1 . (32) 

(n — lj 1 

For (j3"2"|) . we note P(ir(i) = j) = l/(n — 1) since 7r is chosen uniformly from II n . The last equality in P"2"j) 
simply indicates that once we fix w(i) = j, that is the cycle (i,j) in 7r we have to choose an involution 
uniformly at random from the rest of the indices that is Tl n -2 to obtain n in its entirety. This argument 
yields the recursion which we use in (|3"3")l later 

|n n | = (n-i)|n n _ 2 |. 

When we have k ^ j and hence Z ^ i or equivalently j, fc, Z}| = 4 in (|31[) . we obtain 

P(n(i) = k,n(j) = l,n(a) = £ a ,a <£ {i,j,k,l}) 



d^i<=|i(£a.a £ il^a," e i) 



P(n(i) = k,n(j) = l) 

n " 1 in^r 1 . (33) 



((„_!)(„_ 3))-i 



In (|33j) we have used the following equality which follows from the fact that tt is an involution chosen 
uniformly at random. 

p(7r(<) - fc, 7r(j) = = ^W*) = fc)^U(j) - «kw = *) 



(n-3)(n- 1)' 



From (|33[) we see that the conditional distribution dFici} is uniform over all values of £ a for a 6 x fo r which 
= k, £j = I and P(-rr(a) = a G \) > when \{i,j, k, l}\ = 4. Hence we may construct (W, W) following 
(|20| . that is, first choosing I = (/, J), then the images K = (K,L) of I and J under tt and 7r', then the 
remaining images uniformly over all possible values for which the resulting permutations lie in II n . 

We may construct (W', W*) from (J2U) quite easily now. In view of (fH?)) and (|28p. for pairs of distinct 
indices let 

b(i,j,7ti,7tj) = 2(d l7r , +dj Wj - (dij +d nnj )), (34) 

where again we may also let k = £j and I = £j. Now, considering the first two factors in (|24l) . using 
(HI, (HI) and J3DI) we obtain, 



P(lt = i)d J Fl t (C Q ,aei) = P(/t = i,jt=j) c Z^yKt = fc,Lt = Z) 

cx P(I = i,J = j)[d ik + d,-, - (<% + 4 ; )] 2 l(fc ^ /, fc ^ i, 1 + ]) 

oc [d lfe + dji - (dij + d k i)] 2 l(k ^ Z,Zc ^ i,l ^ j,i ^ j). (35) 
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By Lemma I'd. '31 below, relation (|3"5j) specifies a joint distribution, say p(i,j,k,l), on the pairs it = (jt, jt) 
and their images (£/t,£jt) = {K',L*) := Kt, say. Since when j — k, or equivalently i = I the square term 
vanishes, we may write 

p(i,j,k,l) = c n [d lk + dji - (dij +d k i)] 2 l(\{i,j,k,l}\ = 4), (36) 

where the constant of proportionality c„ is provided in Lemma 13.31 Next, note that since i,j,k,l have to 
be distinct in the definition (|3l)|) , the third term in dFic\i(j£ a , a ^ i|£ a ,a € i), reduces to uniform 

distribution over all values of £ a for a £ x f° r which ^ = fc, £j — I and P(7r(a) = £ Q ,a G x) > owing to 
(|33|) . Once we form (W\ W^) following the square bias distribution (TT8l) . its easy to produce W* which has 
the W zero bias distribution using Proposition 13.11 We summarize the conclusions above in the following 
lemma. 

Lemma 3.1. Let dF(w,w') be the joint distribution of a Stein pair (W, W) where 

n n 

W = ^2 di^i) and W' = ^ d l7T , {l) 

i=l i=l 

with 7r chosen uniformly from H n and tt' as in {2ty with I, J having distribution i26\) . Then a pair (W\ W*) 
with the square bias distribution U8\) can be constructed by setting 

n n 
i=l i=l 

where tt^ and n$ = ff^Cijl p are constructed by first sampling it = (jt, jt) and the respective images Kt = 
(ift.it) under according to h36\) and then selecting the remaining images of 7P uniformly from among 
the choices for which it lies in II„. Furthermore if U ~ £/[0, 1] is independent of (W\W^) , then W* — 
UW^ + (1 — U)W^ has the W zero bias distribution. Hence, if n and 7i"t are constructed on a common space, 
then so are W and W* . 

Given the permutations tt and tt' from which the pair {W, W) is constructed, we would like to form 
(W^W^) as close to (W, W) as possible, thus making W close to W*. Towards this end, we follow the 
construction noted after using many of the already chosen variables which form ir and tt' to make 

the two pairs close. In particular, begin the construction of {W\ W*) by choosing it = (it, jt) and Kt = 
(K\L r ) with joint distribution (JMJ), independent of tt and tt'. Let R x = |{7r(Jt), 7r(jt)} n {K\L^}\ and 
i? 2 = |{7r(/t), 7r (ift)}n{jt ! Lt}| ; dearly R u R 2 € {0,1,2}. Define tt+ by 





if 


Tr(jt) 


= ift and 


7r(jt)^it 


hence 


(Ri 


R 2 ) = 


(1,0) 




if 


Tr(Jt) 


^ if t and 


7T(jt) = it 


hence 


[Ri 


R 2 ) = 


(1,0) 




if 


TT(jt) 


= and 


7r(jt) ^ ift 


hence 


(Ri 


R 2 ) = 


(1,1) 


^"/t^tTrt, jtTK-t.it 


if 


Tr(Jt) 


^ it an d 


7t(jt) = ift 


hence 


(Ri 


R 2 ) = 


(1,1) 


7ra Kt,X,t 7 /t,it7jt,Kt 


if 


TT(lt) 


= Jt and 


7r(Vt) ^ it 


hence 


(Ri 


R 2 ) = 


(0,1) 


7ra /t,Jt T /t,LtT 7 t,ift 


if 


TT(lt) 


^ Jt and 


7t(/ft) = it 


hence 


{Ri 


R 2 ) = 


(0,1) 


7T 


if 


TT(jt) 


= K t and 


7T(jt) = it 


hence 


{Ri 


R 2 ) = 


(2,0) 


""Trt.itTjt.Kt 


if 


TT(jt) 


= Jt and 


7r(ift) = it 


hence 


(Ri 


R 2 ) = 


(0,2) 


""Tjt.jtr^t.it 


if 


Tr(Jt) 


= it and 


7t(jt) = #t 


hence 


(Ri 


R 2 ) = 


(2,2) 



(37) 



7raJ t ^-t a jt it when R\ — R2 — 0. 

The partition in display (|37[) is based on the possible values of (i?i,i?2); it does not include the cases 
where {Rx,Rz) = (2,1) or (i?i,i?2) = (1,2) because these two events are impossible. If Ri = 2 and 
7r(/t) = Kt )7r (jt) = it t hcn i? 2 = while if 7r(Jt) = it,7r(jt) = ift then R 2 = 2. Similarly one can rule 
out (JJxjiZa) = (1,2). Similar arguments show us that the cases described above are indeed exhaustive. 
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Clearly any two cases in (|57|) with differing values of (i?i,i?2) tuple are exclusive. Also, any two cases 
with the same tuple value, e.g. cases one and two, are also exclusive. For example, in case one we have 
7r(/ T ) = whereas in case two we have tt(J') 7^ making these two cases disjoint. In summary iv is 
well defined, and this construction specifies the pairs (tt,tt') and on the same space. The following 

lemma shows that the 7r T so obtained is an involution. 

Lemma 3.2. For tt G H n , the permutation 7r T defined in Jff7| j belongs to H n and has the cycles (J T ,if T ) and 
(J T ,L T ). Moreover, with as in Lemma \3.1[ the permutations tt : tt\it^ are involutions when restricted to 
the set X = {l\ J T , K\ L\ 7r(/ T ), 7r( J t ), 7r(if' ), 7r(L T )}, and agree on the complement X c . 

Proof. If we prove 7r T has the cycles (/ T ,i^ T ) and (J T ,L T ), then from (|57|) and n$ = ira^l Jt we have that 

tt, 7r T , ir^ all agree on the complement of I, which proves the last claim in the lemma. Note that it maps X 
onto itself and is an involution when restricted to X. Therefore tt is an involution when restricted to X c , and 
hence so are 7r T and Tr*. So, we only need to prove 7r T has the cycles as claimed. 

Since tt$ = ira^ t Jt , it suffices now to show that 7r T is an involution on X and has cycles (P,K*) and 

( J T , L T ), which can be achieved by examining the cases in ((37]) one by one. For instance, in case one, where 
7r(/t) = iff and 7r(jt) ^ £t we have X = {ft, jt, K\ L\ 7r(jt), ir{l3)} and 

7rt(/t) = 7ra^ t!it (7+)=7r(jt)=^t 

Trt(jt) = 7 ra} tit (jt)=7TT 7t!7r(L t ) (J t ) = 7r(7r(Lt)) =J Lt and 
7 rt( 7 r(J t )) = 7ra^ t , L ti"(>/ t ) = ^jt, 7r( Lt ) T it ,, r (,/t)7r(J t ) = 7r(L T ). 

Hence 7r T is an involution on I and has cycles (I T , A" T ), (J T ,L T ), (tt( J 1 *), 7r(£t)). 

In case ten, |I| = 8, and 7r T will be an involution on X with cycles (I T ,A"f), (Jt,Z/f), (tt(I^), tt(K^)), 
(tt( J'), tt(L')). As an illustration we note 

ir\l r ) = TraJt^tQijt.it^) = va^ iKi {lt) = 7TT Jt , w (irt)T fi :t, J r(.rt)(/ t ) = 7n- / t iTr(Kt )(/ t ) = tt^A 1 )) = K ] . 

That 7r T has the other cycles as claimed can be shown similarly. So in these two cases tv is an involution 
restricted to X and has the cycles (I',K<), (J',L*), 

That 7r T is an involution on X with cycles (I^,K^), (J^,L T ) can be similarly shown for the other cases, 
completing the proof. □ 

Henceforth we will only write a^j for afj unless otherwise mentioned. The utility of the construction 
(|37|) as a coupling is indicated by the following result. 

Theorem 3.1. Suppose tt is chosen uniformly at random from Tl n and (I\j\K^ 
, L') has joint distribution p(-) as in i36]) . If TV is obtained from tt according to J5"7| ) above, then tt and 
7r T are constructed on a common space and tp satisfies the conditions specified in Lemma 

Proof. By hypothesis the indices (P, J', A T , L T ) have distribution in (|36[) . From Lemma [3.21 we see that 
7r T is an involution and has cycles (I T , A T ), (J T ,L T ). It only remains to verify that the distribution of 7r T is 
uniform over all involutions in n„ having cycles (P,K') and (J T ,L T ). That is, recalling that F = (P, P) 
and Kt = {K\]J), letting 

n n ,it, K t - {tt e n„ : = k\^) = tf}, 

we need to verify that 

P( 7r t = #t } Kt) = —i ,=7.^ forall0GlI njIt , Kt . (38) 

bmce are distinct, with I as in Lemma 1531 the size of X satisfies 4 < \X\ < 8. In addition, 

since tt is an involution we see that |{/ T , J T , K\ L T } n {7r(/t), tt( P), tt(A t ), 7r(L T )}| is even. Hence so is \X\, 
and we conclude that X £ {4, 6, 8}. 
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For (j) £ n„ it Kt> independence of X and (lt,KT) yields 

P(7rt = 0|lt,K+) = ^(^ = ^1 l^l=^I + ,Kt)P(|J| = t |lt,K+) 

(.£{4,6,8} 

= £ P(^=<t>\ \X\=l,1\V^)P{\X\=l). (39) 

i£{4,6,8} 

For 7r £ II n let 7f denote the restriction of ir to the complement of P, J\ , . First consider the case 
l = 4 that is X = {P , J\ K', L^}. Since iv £ n nI t,Kt the permutation agrees with every <f> £ n n jt Kt 
on P ,P , K\ L\ and, as and it agree on Z c , 

P(V = <f>\ \X\ = 4,I t ,K t ) = P{^ = lj) \X\ =4,I t ,K t 

= P(w = lj>\ \X\ = 4,I t ,K t ) 
1 



|n„_ 4 | 

Now suppose l ~ 6. In this case, the set J = X \ {V , j\ K\ L^} has size 2, say J = {ii, ia}. We 
claim that 7i"T has the cycle («i,«2) with {11,12} = {7r(a), 7r(&)} for some a,b £ {P , j\ K\ L^}, and that 
conditional on \I\ = 6 and {P , J\ , P} the values ij.,12 are uniform over all pairs of distinct values in 
{V, , if 1 ', L t } c . That 7T has the cycle (11,12) follows from Lemma Suppose (i\, i 2 ) = (tt(J^), 7r(i^)) as 
in case 1 in (13"T)) . since ,P do not form a cycle, and their images under ir under case 1 is constrained exactly 
to lie outside {P , J\ K t, P}, as 7r is uniform over II„, these images are uniform over {/^, J^, iff, P}°. One 
can show that these properties hold similarly in the remaining cases. The remaining cycles of tp are in 
X c and thus are the same as those of ~k\t"- Thus, the cycles of 7rt are conditionally uniform, that is, for 

4> g n n Tt,Kt 



P(V =(/>| |X| = 6,1*,!^) =p(?rt =0 \X\ = 6,I t ,K t ) = — 

v / n 



The case 1 = 8, that is, where R\ = R2 — 0, is handled similar to t = 6. Here it 1 ' will have the cycles 
(7r(/t) ;7r (^t)) 7 (7r(jt) )7r (iit)). These two cycles are both of the form (77(a), tt(&)) with a,b £ {l\j\K\tf} 
and hence, as in the case t = 6, uniform random transpositions. Since, n and agree on X c , we can see 
that 7rT has uniform random transpositions on {P , J', K', P} c —X C U {Tr(P),Tr(J^),Tr(K'), ir(P)} yielding 

p(J = <f>\ \x\=%,i\tf)=p(rf = 4> m = 8,lt,K+) = —3 — 

\ / |H n -4| 

Thus (J3H1) now yields 

P(7rt = 0) = ' 



|n„_ 4 |' 

verifying (1381) and proving the theorem. □ 

So the tuple (tt^tt^) obtained in Theorem 13.11 satisfy the conditions in Lemma 13.11 Hence (W^W*) 
constructed from (71^, th) as in Lemma 13. II has the required square bias distribution. 

We conclude this section with the calculation of the normalization constant for the distribution p(-) in 

(El. 



Lemma 3.3. For n > 4 and D = ((dij)) nxn satisfying A15\) , we have 

c n Y \- dik + d i l _ + rffe; )] 2 = 1 where c„ = — — - 1 - = 0(^j), (40) 
\{i,j,k,l}\=4 [H ' [U ' 
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and in particular 



c n < — when n > 9. (41) 



/j 3 



Proof. From (|28|) . we have 

W -W = 2(d /ir(/) + d j7I (j) - (d u + d^( I)7l{J) )), 

where (/, J) are two distinct indices selected uniformly from {1, 2, . . . , n}. Since tt is an involution chosen 
uniformly, we have 

E (W - W'f = -r±— mdi^ + d Mj) (d l3 + d <l)A]) ) f 

4 Yl ( d ik + d ji - ( d v + dkl)f- (42) 



n(n — l) 2 (n — 3) 

1 71 ' \{i,j,k,l}\=i 



Using ((42]), ([17]) and (J29J) and a 2 D = 1, we obtain 



4 



n(n- l) 2 (n-3) lr 

^ ' K > \{i,j,k,l}\=4, 



^ (d ik + dji - (dij + d k i)f = 2\a 2 D = 



On simplification, we obtain 

1 



(dik + dji - (d i:j + d k i) f = 1, 



2(n-l) 2 (n-3) 

proving (|40| . The verification of ff4T| ) is direct. □ 

4 L 1 bounds 

In this section we derive the L 1 bounds for the normal approximation of W = X)T=i ^»'»r(<)) where rr is chosen 
uniformly at random from II„. The main theorem in this section is the following. 

Theorem 4.1. Let n be an involution chosen uniformly at random from IT„ and D = ((dij)) be an array 
satisfying U5\) . Then with ftp as in W — ^i-n-(i) satisfies 

\\F W - $||i < — (224 + 1344- + 384-^ ) when n > 9. 
n \ n n 2 J 

In particular, 

\\Fw-$\\i < 379— whenn>9. 
n 

We will need the following inequalities in order to prove Theorem 14. II To avoid writing down the indices 
over which we are summing, unless otherwise specified the summation will be taken over the same index set 
as the one immediately preceding it. 

With p(-) as in ((3l>() . in what follows, we will apply bounds such as 

^ \dik\p(i,j,k,l) = c n \dik\[d ik + dji - (djj + d M )] 2 

|{i,i,fc,*}|=4 

< ^ \dik\[dik + d 3 i - (d i:j + d k i)} 2 

i,j,k,l 

= cnJ2\ d ^ I (4 + 4 + 4 + 4 ) 

< 4c n n 2 f3 D < 4— when n > 9, from gj). (43) 

n 



11 



The first nontrivial equality above uses the special form of the term inside squares. Whenever we encounter a 
cross term we always get a free index to sum over which gives us zero since d,+ = Vi. The second inequality 
uses the fact that for any choices ti, L2, K\, K2 € {i,j,k, 1} with i\ 7^ K\ and 11 ^ k%, perhaps by relabelling 
the indices after the inequality, 

E \ d ^\< K2 < (Ei^-i 3 )*(Ei d «i 8 ) s ^ n2 ? D - 

Generally the exponent of n in such an inequality will be 2 less the number of indices over which we are 
summing up. For instance, if we are summing up over 5 indices the exponent of n will be 3 and so on. In 
particular, 

E \ d i.s\p(hj,k,l) < 4n 3 c n f3 D where 1 € {i,j,k,l} 

\{i,j,k,l,s}\=5 

< 4/3 D when n > 9, using (gTJ and (44) 
E \d s t\p{i,j,k,l) < Arfcnfin <Anfi D when n > 9. (45) 

\{i,j,k,l,s,t}\—6 

Theorem 4.2. Suppose D = ((dij)) is an array satisfying M5\) and ir and tt^ are as in Theorem lff.il and 
tt$, W\ and W* are as in Lemma \S.l\ Then W, W^W^ can be decomposed as 

W = S + T Wl = S + T* W i = S + T i , (46) 

where 

s = E rf wr(»)> T =/]d in (i), r f = E rf fa f M fl7lrf T * = E^W' ( 47 ) 

i^i iez iei iez 

where X is as in Lemma \3. 2\ Also, W* has the W zero bias distribution and satisfies 

E\W - W*\ < 112— + 672^ + 192^ when n > 9. (48) 

In view of (p~5]) Theorem 14.21 implies Theorem 14.11 

Proof. Lemma [3.11 guarantees that W* has the VF-zero biased distribution. Recalling I = {l\j\K\L\ 
Ti{I Ji ),TT{j' s ) 1 i:{K Js ) 1 i:{L' s )} and that 7r,7rt and n$ agree onI c by Lemma l3~2l we obtain decomposition (|46l) . 
From = + (1 _ u)W* and @6J|, we obtain 

E\W - W*\ = E\UT* + U)T X -T\. (49) 

Using the fact that E(U) = 1/2, and that U is independent of T' and T*, we obtain 

E\W-W*\ < -(E\T^\+E\T t \)+E\T\=EV where V = \T^\ + \T\, (50) 

where the equality follows from the fact that TT,ir*, and therefore T>,T$, are exchangeable. 

Thus our goal is to bound the L 1 norms of T and T^ and we proceed in a case by case basis, much along 
the lines of Section 6 in [3]. In summary, we group the ten cases in (|37|) into the following five cases: Rx = 1; 
i?i = 0, i?2 = 1; R\ = 2; R\ = 0, i?2 = 2; R\ = 0, i?2 — 0. 

Computation on R\ = 1: The event i?i = 1, which we indicate by li, can occur in four different ways, 
corresponding to the first four cases in the definition of ir> in (|37|) . With V as in (|50|) . we can decompose li 
to yield 

Vlx = Vl hl + Vl lt2 + Vl lt 3 + V1 1A , (51) 
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where l^j = l(n(I^) — K^ir^) ^ L^) and li )TO for m = 2,3,4 similarly corresponding to the other three 
cases in (l37|) . in their respective order. On l^i, we have X — {l\ j\ K\ L\ 7r(jt), ^(L^)} and 7r(7^) = K\ 
yielding 

riia = ^d i7r (j)li,i = 2(d /t j<rt + <ijt w (jt) + ^z,t7r(it))li,i- (52) 
iei 

By LemmaO^ has cycles (Z t ,ii' t ), (J*,tf) and is an involution restricted to X, hence 

T^li,i = d i7r t(j)li,i = 2(d/tift + djtLt + rf,r(jt) ff (Lt))li,i- (53) 

So, we obtain 

E|T|1 M < 2(£|ditA-t|li,i + -B|djt ff (jt)|li,i + -B|diMxt)|li J i) ( 54 ) 
B|Tt|l X)1 < 2(£?|d I t lJf t|l 1 ,i + S|djt,£t|lM+^K(Jt), T (it)|lM). (55) 
Because of the indicator in l]54p and (|55p. we need to consider the joint distribution 
p 2 (i,j, k, I, s, t) = P((jt, Jt, ift, it >7r (jt) j7r (.7t)) = (ijj> fe, /, S) t )), 

which includes the images of P , ,P under 7r, say s and t respectively. With c„ as in Lemma 13.31 we claim 
p 2 {-) is given by 



p 2 (i,j,k,l,s,t) = < 



^jp(i,j,k,l) when s = j,t = i 

when s = j,t ^ i or s =/= j,t = i 

when s = i ot t = j (56) 

when s = t 

( n _i)( n _3) P(2,.7, fc .O when t ^ {j,s,i} ^ s ^ 

To justify (1561) . note first that s = j if and only if t = i, for example, s = j implies t = = n(s) = 
7r(7r(i)) = i, and therefore 

{s=j} = {s = j,t = i} = {t = i}. (57) 

Thus the second case of ([56]) has zero probability. The remaining trivial cases can be discarded using the 
fact that 7r G n n . Leaving these out, the first probability is derived using (|57|) . since the image of V under 
7r is uniform over {I^} c and independent of {l\ J\ K\ L^), so in particular takes the value s = j with 
probability l/(n — 1). In the last case it is easy to see that t ^ {j, s, i} and s {i,t,j} are equivalent. The 
image of V is uniform over all available n — 1 choices, and, conditional on tv(P) ^ J', the n — 3 remaining 
choices for the image of fall in {I',tt(I'), jt} c uniformly. 

Next we bound each of the summands in (fM)) and ([53]) separately. First note that under l^i only the 
last form of p 2 (i,j,k,l,s,t) in (I56p is relevant. In particular, s ^ t always, and since i,j,k,l are distinct, 
s = k implies s {i,j}- Hence, for the first summand in (|54p . we obtain 

S|d 7 tift|li,i = ^ \d ik \p 2 (i,j,k,l,s,t)l(s = k,t^l) 

i,j,k,l,s,t 

= \ d ik\P2(i,j,k,l,k,t) 

\{i,j,k,l,t}\=5 

= 7 r ~ru~ t\ \ d ik\p{i,j,k,l) 

(n — l)(n — 3 

< —, -t/3d using gl 

n(n — 1) 

< 8^ when n > 9. (58) 
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Similarly, we can estimate the second summand in (|54[) as 

■S|djt^(jt)|li,i = X] l d it|P2(i,i,fc,/,s,i)l(s = k,t ^ I) 

i,j,k,l,s,t 

= ^2 \ d jt\P2(i,j,k,l,k,t) 

\{i,j,k,l,t}\=5 

= 7 TT7 y2\djt\p(h3,k,l) 

[n — \)(n — 3) 

4 

^ 7 TT7 ^Pd using flUj, when n > 9 

(n — l)(n — 3) 

< 8^ when n > 9. (59) 
The last summand in (|54|) is £J|(i i t 7r (i,t)|li,i- Using the fact that 

(Jt, J+.^.L+.TrC/t),^)) £ (^U^, J*M&)ML% 

c 

where = denotes equality in distribution, ()59|) yields, 



^|rf Lt7r (Lt)|li,i = ^Mjt w (jt)|li,i <8^ whenn>9. (60) 
Thus combining the bounds in (|58l) . (159)) , ([60)l we obtain the following bound on the term (|54[) . 

E\T\\ X i < 48^ when n > 9. (61) 
rr 

Now moving to (|55p . we note the first summand in (|55[) is the same as the first summand of (|54p . and 
so can be bounded by (|58|) . We can bound E\dji Lt|li,i, which is the second summand in (|55"|) in a similar 
fashion as in (|58|) through the following calculation 

£|<7tLt|li,i = \dji\p 2 (i,j,k,l,k,t) 

\{i,j,k,l,t}\=5 

= 7 ^ \ d ji\p{hJik,l) 

(n — l)(n — 3) ^— ' 

< 8^ when n > 9, using gl. (62) 
n 2 

So we are only left with the last summand of (|55|) which is £|d ff (jt) ir (it)|li,i- For this we will need to 
introduce the joint distribution 



P3 



k, I, s, t, r) = P((/t, Jt, K \ ) L \ 7r (jt) > 7r (jt) > 7r(£t)) = (i, j", fc, i, s , t, r)). (63) 



The case li.i is equivalent to s = k,t =/= l,r ^= j, and, since n € II „ it is equivalent to s = k and 
\{i,j,k,l,r,t}\ — 6. We claim that 

p 3 (i,j,k,l,s,t,r) = 1 —p(i,j,k,l) when s = fc and j, k, I, r, i}| = 6. (64) 

(n — l)(n — S)(n — 5) 

To justify (|64"|). we note that since tt is independent of {T>, J\ K\ L^}, the image of P is uniform over n— 1 
choices from {-F} c and conditional on 7r(/t) = ift ) the n— 3 choices for 7r(JT) fall in {V, J>, K^} c uniformly. 
Conditioned on these two images, when 7r(J^) ^ I/, n(L') is distributed uniformly over the n— 5 choices of 
{I\J\K\L\^)Y. 
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Now we can bound S|dn-(jtw(£t)|li,i in the following way, 

- B M 7r (./t) 7 r(Lt)|li,i = ^2 \dtr\P3(i,j,k,l,s,t,r)l(s = k,t^l,r^j) 

= ^2 \d tr \p 3 (i,j,k,l,k,t,r) 

\{i,j,k,l,t,r}\=6 

= 7 TV 1 oT? iTy^Mtrb&jjM) 

(n — lj(n — o)(n — 5) ' 

" (n-l)(»-3)(»-5) fa USing63 

< 16^ when n > 9. (65) 
So, adding the bounds in ([5"8" |) .(|6"2" ]l and (|55|) . and using (|55l) we obtain, 

S|T t |ln<64^ whenn>9. (66) 
n 2 

From (|6ip and (|6"6"|) . using the definition of V in ([5ll|) . we obtain the following bound on the first term of 

HEED, 

EVli i < 112^ when n > 9. (67) 
n 2 

Next, on 1 1)2 , indicating the event 7r(/t) ^ #t )7r (jt) = it ) we have I = {jt, jt, Jff ? £t ? ^(Jt), tt(K^)} 
and hence, by definition (|47l) . 



Til, 2 = 2(d Jt7r(/ t) + JffMift) + rfjt£t)ll,2 

We further observe, 

(I 1 , A J ftT t , L t , n(P), tt( J f ), tt^), 7r(Lt)) = ( J f , I 1 , L 1 ", if 1 ", tt( J f ), ^(i 1 "), vr^), (69) 
which because of (|6ip yields 

Tlx a =Tln =► £?|Tjli a = #|T|ln < 48^ for n > 9 (70) 

n 2 

Furthermore, the distributional equality in (1691) implies 

T*l lj2 = 2{d r \ K -\ + dj-\ L -\ + rf 7r (/t) 7r (Kt))li,2 = T t li,i, (71) 

yielding 

ElTtllia = J5|T+|ln < 64^ when n > 9. (72) 

Thus combining (|70l) . ([72l we obtain 

EVli 2 < 112^ when n > 9. (73) 

Next on l lj3 , indicating 7r(i f ) = L t ,7r(jt) ^ ift, we have X = {P, jt, if t, £t )7r (jt) )7r (jft)} an d 

Jli,3 = 2(d/t£t +djt 7r (,/t) +rfift 7r (i<rt))li,3, (74) 
T ,1 'li,3 = 2(djf K t + djf L f + dff(jt) ff (A:t))li,3- (75) 
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On 11,3, we have s = l,t k which is equivalent to s = I and k, l,t}\ — 5. Hence we may bound the 
first summand in (|74|) as follows 

E\dn L i\li, 3 = ^2 \ d a\P2(hj,k,l,s,t)l(s = l,t^k) 

i,j,k,l,s,t 

= ^2 \ d u\P2(i>j,k,l,l,t) 

\{i,j,k,l,t}\=5 

n - 4 



(n - l)(n - 3) 

< —, ttPd using O 

n(n — 1J 

8%- when n > 9. 



Continuing in this manner we arrive, as in (|73p . at 



Wli.s < 112% when n > 9. (76) 



/j 2 



Symmetries between lj. i and li j2 such as (j69|) hold as well between 1^3 and I1.4, yielding 

EVli 4 < 112% when n > 9. (77) 
Combining the bounds from (|67|) . (|73|) , (|76l) and (f77|) . we obtain 



EVU < 448% when n > 9. (78) 
Computation on R\ = 0, R2 = 1: Now we need to make the following decomposition 

Via = Vl 2 ,i + Vl 2 , 2 , 

where 1 2 = l(i?x = 0,R 2 = 1), l 2jl - l(7r(/t) = jt^jft) ^ Lt) ; i 2 2 = ^(/t) ^ jt^(^t) = L t). n 
l 2 ,i we have I = jt, K\ L\ tt(K^), 7r(£ f )} which gives 

Tl 2 ,i = 2(d JtJ t +d iRr t ff (Kt) +dit w (it))l2,i and 

T^li,l = 2(d/tjft + djtit + ^7r(ift)7r(it))l2,l- 

So, we obtain, 

£|T|l 2 ,i < 2(^1^^112,1+^1^(^)112,1+^1^^)112,1), (79) 

^|Tt|l 2>1 < 2(B|d J tjft|l2,l + S|djtLt|l2,X + B K(JCtMLt)|l 2) l)- (80) 

Since j, k, I) — p(i, k, j, I) and tt is chosen independently of {V , Jt, iC^, i/}, we have 

(/t, ift , jt j L t, 7r ( / t ) ^ (K t )j 7r ( jt)^ (jL t )) ^ (/ t ; jt, K \ L t, Jt), Tr(ift), vr^t)). (81) 

Hence we obtain 



which, by (|5Tj) gives 



Tl 2 i =Tlii, 



J5|T|l 2 i =J5|T|ln <48% whenn>9. (82) 
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To begin bounding _E|Tt|l2,i, we bound the first summand in S|d/tKt|l2,i) as follows 
£|<W|l2,i = £ \dik\P{{I ] ,K\j\L^-K{P)^{K^)) = {i,k,j,l,j,r)) 

\{i,3,k,l,r}\=5 

= J2\ d ik\P((I i ,J 1 >K\L\7r(I^),7r(J^)) = (i,k,j,l,j,r)) using 

= 7 q \ J2 \<Uk\p(i,j,k,l) 

(n — l)(n — 3) 

< 8% when n > 9, using gl. (83) 

Also using the distributional equality (I^,j\K\L\n(P),ir(K^)) = (J^,P,L\K\tt(J^),%(L^)) and the 
bound in ([55)1 above, we obtain 

£J|d J+i t|l 2 ,i =J5|d/tjft|l2,i < 8% whenn>9. (84) 

n 

Using the distributional equality in (IBTj) and the bound in (|65)l 

- E M7r(Kt) w (Lt)|l2,l = £'M 7r (jt) T (Lt)|ll,l < 16 ^J- ( 85 ) 

Combining dHU , dHS]) and using ([80]) we obtain 

£|T t |l 2 i<64% whenn>9. (86) 
Adding the two bounds in (|82|) and (|86|) we obtain 

£yi 2 ,i < 112 ^f when n - 9 - ( 87 ) 
Next, on 1 2 , 2 , where 7r(/ t ) ^ jt j7 r(A^) = L+, we have 1 = {P , jt , jft, L t , 7r(/t), tt( jt)} and hence 



Tl2, 2 = 2(d /t7r (/t) + djt w (jt) + d if tLt)l 2 .2 
^12,2 = 2(d /tKt +djtit +d w (/t) ff (jt))l2,2- (89) 

Noting the distributional equality 

(jt, J t ,A t ,i t ,^(/ t ), 7 r(J t ),7r(A t ),^(L t )) = (Jft,£t j /t j jt j7r (|ft) j7r (£t) )7r (/t) j7r (jt)) 

we obtain 

Tl 2 ,i = Tl 2 , 2 and T^i = Ttl 2 , 2 , 

which yields 

£|T|1 2 , 2 = E\T\l 2 ,i < 48% and E\T^\l 2i2 = B|Tt|l 2il < 64%. 

//- //- 



Hence we have 



#V1 2 2 < 112% when n > 9. (90) 
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Combining ([57]) with (|M|) we obtain 



EV1 2 < 224^ when n > 9. (91) 



?; 2 



Computation on i?i = 2: Here we need the decomposition 

Via = Vl 3 ,i + Vl 3 ,2, 

where 1 3 = l(i?i = 2), l 3 ,i = l(7r(/t) = Xt ;Vr (jt) = Lt) an d i 32 = l( 7 r(/ t ) = ^^(jt) = /ft). Note that 
13,1 and 13,2 correspond to the cases in (|57|) where (i?i,i?2) = (2,0) and (Ri,R 2 ) — (2,2), respectively. On 
both l3 t i and 1.3,2, we have I = {P, jt, K\ L'} and 

Th 3 ,i =Tl 3 ,i = 2(d JtK t +djt£t)l3,i since 7rl 3 ,i = 7rtl 3 ,i. (92) 

From (Jt, jt J .K't > .Lt j7r (jt) j7r (jt)) £ (jt j jt, Lt, Jft, 7r (jt) ; 7r (/t)) ; it i s dear that £?|djtjrt |l 3 ,i = ^Mjt L t |l 3 ,i 
and hence it is enough to bound any one of the two summands in (|92p . We bound E\ditKi |ls,i using (|4"3")l 
as follows 

S|d/tKt|l 3 ,i = Mifc|p2(«, j,k,l,s,t)l(s = k,t = I) 

i,j,k,l,s,t 

= ^2 \dik\P2(i,j,k,l,k,l) 

\{i,j,k,l}\=4 

< 8^ when n > 9. (93) 

Thus, using (|52")l and (|93p .we obtain 

£V1 3 1 < 64^ when n > 9. (94) 

On I3 2, we have 

n 3|2 = 2(d Jtit +(l jW ) and T+l3,2 = 2(d Jt Kt+djtLt). (95) 
To obtain bounds for £7(Vl3,2)j we bound the first summand in (|95j) as follows, 

E\dnLi\T-3,2 = ^2 \ d u\P2(i,j,k,l,l,k) 

\{ij,k,l}\=4 

= 7 TT? ^y>dKU,M)<8% when n > 9,using 63]). (96) 

(n — l){n — 6) *■ — ' nr 

Similarly its easy to obtain bounds on the other summands also and conclude as in (|94[) . 

£V1 3 ,2 < 64^ for n > 9. (97) 



/j 3 



Combining (l94l) and (|97|) . we obtain 



SVI3 < 128^ when n > 9. (98) 

n 3 

Computation for i?i = 0,i?2 = 2: This event is indicated by I4 = l(7r(/t) = jt^^t) = L'). On I4, we 
have 

Tl 4 = 2(rf Jt jt +d A 'tLt)l 4 , (99) 
T+l 4 = 2(d /tK t + d,/t L t)l4. (100) 



18 



For the first term in 

E\d ItJt \U = Yl \d ll \P((l\K\j\L\7r(P),ir(Kl) = (i,k 7 j,l,j,l)) 

\{i,j,k,l}\=4 

= ^2\dij\p2(i,k,j,l,j,l) using dH]) 

< 8^ when n > 9. (101) 

For the other summand in and (|100[) , we can follow similar calculations as in (jlOll) above and obtain 
the same bounds. Thus we will finally obtain 

£|Tl 4 |,.E|T t l4| < 32^ when n > 9. (102) 



So we have 



J5V1 4 < 64^| when n > 9. (103) 



Computation on R\ = R-2 = 0: Now we bound the L 1 contribution from the last case in (|37|) denoted by 
1 5 = l(Ri =i? 2 =0). Here we have Z = {/t, j\ K\ L\tt{I^),tt{J^),tt{K^),tt{L^)} and 

Tl 5 = 2 (d/t 7r (/t) + rf./t 7r (,/t) + d^t 7r (Kt) + rfLtTr(Lt)) (104) 

T T 1 5 = 2 (d/tift + d/t£t + 4(/t),r(ift) + rf w (Jt)7r(Lt)) • (105) 

Since we are on I5, we need to consider pz(-) as introduced in On I5, ps(-) is given by 

P3(i,j,k,l,s,t,r) = — 1 —p(i,j,k,l) when \{i,j,k,l,s,t,r}\ = 7. (106) 

(n — l)(n — 3)(n — 5) 

The justification for (|106l) is essentially the same as that for 
Using P3(-)i we begin to bound the summands in (I104p . 

£, M/t^(/t ) |l5 = 51 \dia\pa(i,j,k,l,s,t,r) 

\{i,j,k,l,s,t,r}\=7 



(n-6)(n-5) 



X! \ d is\p(i,3,k,l) 



(n-l)(n-3)(n-5) . 

I {2,JJ,fc,i,S^ 1=5 

< 8— when n > 9. (107) 

n 

It is easy to see that V , j\ K have identical marginal distributions and since 7r is chosen independently 
of these indices, we have E\d N7r ^ N - ) \l 5 is constant over {I>, J\K\ L^}. Thus we obtain 

£|T|1 5 <64— whenn>9. (108) 
n 

Now bounding the L 1 norm of the first summand in (jl05p . 

E\d^ Ki \l 5 = ^ \<Uk\Ps(i,j,k,l,s,t,r) 

\{i,j,k,l,s,t,r}\ = 7 

(n-4)(n-5)(n-6) 



^2 \<kk\p(i,3,k,l) 



(n-l)(n-3)(n-5) Ir 
< 4— when 7i > 9, using @3J. (109) 

71 



19 



Now consider the last summand of (|105[) , 

-EK(J1>(Lt)|l5 = E \dtr\P3(i,j,k,l,s,t,r) 

\{i,j,k,l,s,t : r}\—7 

s (»- D(" = S)(„- 5) |((J S r)M 

< 8— when n > 9, using g5). (110) 

Since (/t, jt, iff, it, ^(/t), ^(jt), 7r ( J ^t) j ^j&t)) £ (jt, l\ tf, K\ 7r(jt), 7r(/t), tt(X,+), »r(Jf f )), we see that 

S|dj tJrt |l 5 = £7|d Jti t|l5 and £K (/ t Mi ft) |1 5 = £K(ji> ( Lt)|l5- (HI) 
So, using (|TUgi) . (|TTUl) along with (fTTTj) . we obtain 

^|T t |l 5 <48— whenn>9. (112) 



Combining f|108[) and (|112[) . we obtain 

BV1 5 < 112— when n > 9. (113) 
n 

Combining the bounds from (|78")l . (|9Tj) . (j98 |) . (|T03| and ([Tl3]) . we obtain 

E\W-W*\ <EV< 112— + 672^ + 192^ when n > 9. (114) 



This completes the proof of Theorem 14.21 □ 

5 L°° bounds 

In this section we will use Theorem l4 . 2 1 obtained in the previous section to obtain L°° bounds using arguments 
similar to those in [I]. It is worth noting that we can use L 1 along with L°° bounds to obtain L p bounds for 
any p > 1 using . The main theorem of this section is the following 

Theorem 5.1. Suppose we have an n x n array D = ((dij)) satisfying dij — dji,du = di+ = V i,j and 
Ojj = 1. IfW = ^2 diTrii) where it is an involution chosen uniformly at random from H n , then for n > 9 

||JW-*||oo<-K'— (115) 
n 

Here K = 61, 702,446 is a universal constant. 

Theorem 15.11 readily implies Theorem 12.11 by ([3]) and Theorem 14. II 

We claim that to prove Theorem 15.11 it is enough to consider arrays with fip/n < e = 1/90, and 
n>no = 1000. To prove the claim, first note that 



pi = ( E i^-i 3 ) 1 > "- 1/3 (E fe-i 2 ) 1 = n ~ 1/3 ( {n xTi) 3 T ■ (116) 

l<i,j<n ^ ^ ' ' 



which is greater than 1/2 for n > 4. In ()116|) . the first inequality follows from Holder's inequality and the next 
equality follows from the fact that a 2 D = 1. So if n < no then ftu/n > l/(8no). Since K > max{l/eo, 8no} 
inequality (|1 15[) holds if po/n > eo or n < hq. 
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One useful inequality that will be used repeatedly in the proof is the following 

3 



(117) 



The proof of Theorem 15.11 proceeds by first proving several auxiliary lemmas. The first lemma helps 
bound the error created when truncating the array as in pQ, page 382. For the array D = ((dij)) nX n, we 
define 



Letting T — : \d tJ \ > |} and T t = {j : e T} we have 

|r<] < 8^ \dij\ 3 and hence 

3 

|r| < 8(3 D . 

Inequality (|120|) has the following useful consequence, that is, 



(118) 

(119) 
(120) 



\T\ ^2|r| < 16fe_ 
n — 1 rt — rt 



(121) 



Lemma 5.1. Suppose ((dij)) nxn satisfies the conditions in Theorem 15.11 Then with ((<^.-)) n xn defined in 



(122) 



Ull) ) /3_d < eo n awc ^ n > no, ™e /lave, 

\d' ++ \<4/3 D and \d' i+ \ < 4^ |<%| 3 < 4fe, 



and therefore 



IMC 



< 



l<rfy - II < 10 



n 

/3c 



and /?£>/ < 22/3d. 



Proof. Using d++ = and (|120[) . we have 
k/' 



E d «l = l E ^ E < |r|W < 

(ij) : |d 1J |<i/2 (ij)er (i,3')er 



Similarly, as d i+ — for all i £ {1, . . . , n}, we obtain using (|119l) . 

K + i = iE^i = iE^i^i r «i 2/3 (Ei^i 



1/3 



<4^|d ii | 3 <4/3 D . 



Now, using fjj and (fT2"3"j) , 



n - 1 



<i^<8^. 
n — 1 n 



(123) 



(124) 
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To prove the last assertion, first note that by (|124[) we have 

n n 

E k+i 2 ^ 4 ^E K+i ^ 16 ^EE i^i 3 = 16 ^- 

i— 1 i—l i j 

Similarly one can obtain 
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Ei^+i 3 ^ 64 / ? 



(125) 



(126) 



From ([5]) and the fact that = 1 we obtain the following, 



kry-1 



< 



< 



< 



2 


(n- l)(n- 


-3) 


2 




(n — l)(n — 


3) 


2 





(n- l)(n-3) 

o 



((n-2) E (4-4) + ^' + 2 + -2Ed 



(n-2) E 4 + ^? + +2E< 



/2 



(«,j')er 



(n-2) E 4 + 16 7 + 32/% I using O and (127) 



(ij)er 



32 



0% 



64- 



(n-1) 



n — 1 (n — l) 2 (n — 3) (n — l)(n — 3) 

< 10— < - since D /n < 1/90 and n > 1000. 
n 9 



(128) 



Hence, we obtain |cr|j, — 1| < 1/9. Thus using © with notation as in (fTTj). and inequality (| 1 1 7[) . we obtain 



(T 3 ^ 

* i6 ^e(^. 



d' 



13 n-2 7i-2 (n-l)(n-2) 
\d> 



' 13 



3 , Kh 

" (n-2) 3 ^ (n-2) 3 " r ((n - l)(n - 2)) 3 



16— ( - !2s- ^ 



(n-2) 3 



64n^ 



((n-l)(n-2)) 3 



< 22^ D since - 1| < 1/9, /3i3/n < 1/90, 
where in the second inequality we use (|123[) and (|126[) . 



□ 



Lemma 5.2. Let D = ((dy)) oe an array as in Theorem \5.1\ and D' = ((d^)) &e as m \118\) . Let us denote 



djj — 



If Pd/h- < e o a?J d n > n , we /lave \dij\ < 1. 

Proof. Because a 2 D , > 1/2 from Lemma T5.ll on 1 > 2/3. Using this inequality, definition (j6]) and (j 122[) we 



22 



obtain 



\<kj\ < — (K 



< 



OW 

3 



1 



\d> 



'" n-2 v 
8 (3 D 4 



*+l + !**!) + (n -i) (n - 2) 



4 ctd> rt - 2 ct/j/ (n- l)(n- 2) 

< - + 16— + 4— since n > 1000 
4 n n 

= \ + 20— < 1 since (3 D /n < 1/90. 

4 71 

This completes the proof. 

For E — ((ey))„XTi let -Fe be the distribution function of Ye and 

6e = \\F E -QWoo. 
For 7 > let M n (-f) be the set ofnxn matrices E = ((ey)) satisfying 



Let us define 



Also, we define 



a% = 1, ey = e,-j,e<i = e i+ = Vi,j and /3e < 7. 



5(7, n) = sup 6e- 
seM„( 7 ) 



(5 1 (7, n) = sup S E where -^(7) = {£ e M n (^) : sup |ey| < 1}. 

ESMi( 7 ) i,j 



Lemma 5.3. When n > tiq, with 5d and 5 1 (7,n) defined in U29)) and H31}) . for all 7 > 

sup^z, : D G M n {i),p D /n < e } < 5 1 (22 7 ,n) + 36^. 
Proo/. Let L> = ((dy)) G M„( 7 ) with fe/n < 1/90. Hence, by (TT2Tj) and Lemmas ET21 and ISTTl 

5 D = \\F D -noo < sup\P(Y D , <t)-Q{t)\+P(Y D +Y DI ) 

t 

< sup\P(Y D , <t)-$(t)| + ^ 



< ^(22/3^)+ sup |<&(^^)-<i>(<)| + ^. 



CD' 



Since i5 1 (7,n) is monotone in 7, 



6 1 (22/3 D ,n) <d 1 (22-y,n) when /3d < 7; 



□ 



(129) 



(130) 



(131) 



(132) 



similarly, we bound the last term as 16/3d/^ < I67/71. So, we are only left with verifying that the second 
term is bounded by 20j/n. 

From Lemma I5TT1 we obtain \a D , — 1| < 1/9 yielding in particular aw G [2/3, 4/3]. First consider the case 
where \t\ > 8/3d/?i, and for a given t let ii = (t — /j>w)/&d'- Since \fJ-w\ < 8/3o/n by Lemma l5.11 i and ii will 
be on the same side of the origin. Next, it is easy to show that for a > we have \t exp(— at 2 /2)\ < 1/yfa. 
Hence 



t exp 



9 

32 



(t-fiw) : 



< 



(t - fiw ) exp 



32 



4 

< - 
- 3 



(133) 
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Since aw > 2/3 and Lemma fSTTI gives \a 2 D , — 1| < lQPu/n, we find that 



(Tfl/ +1 n 

Now, by the mean value theorem , aw & [2/3,4/3], and Lemma |5~T1 



|$(ii)-$(t)| < max 
1 



< 



< 



< 



< 



< 



, t - jiw 
aw 



t - (iw 



- 1 



aw 



where 6 = 



27T 

3 



max<j exp(- — (i-^i D /) 2 ),exp(- — ; 



2V2tt 
3 



0"/)' — 1| max 



^exp(- — (t-MD') 2 ) 



C7D/ 



*exp(- — ) 



. )7r ,^D' - + IMD'D) + using 



2tt 



|o\d- - 1 1 (1 + \nw 



2 i ^ ( i + 8 ^) + 6 
V2tt n n 



4 



I to' 



< 17— since fe/n < 1/90. 



/2vr 



to' 



/2vr 



to' 



(134) 



When |t| < Sfiu/n, the bound is easier. Since ii lies in the interval with boundary points 3(i — /irr)/2 
and 3(i — /Z£>/)/4, we have 



(135) 



Now, using |i| < SPn/n, \[iD'\ < 8/3_o/n and (|135|) . we obtain 



i 



< 



2tt 
1 



U -t 



(3|t|+2|^|) 



Thus we obtain 



completing the proof. 



1 to Pd 
< ^=40— < 20—. 

V27T n n 



sup |$(L_^1) _ $( f )| < 20^ < 20^, 
t aw n n 



□ 



In view of Lemma 15. 3( to prove Theorem 15.11 it remains only to show 5 1 (7, n) < cj/n for an explicit c 
which we eventually determine. Hence in the following calculations we consider only arrays D = ((dy)) nX n 
in M*(7), and so sup 4 • \dij\ < 1. 

We will need the following technical lemma. 

Lemma 5.4. If D E M^{pf) and B = ((6-y)) n _; X n-J * s ^ e array formed by removing from D the I rows and 
I columns indexed by T C {1, 2, . . . , n}, where I is even and I = \T\ < 8, i/ien /or n > no and /3d /n < eo/50, 
\hb\ < 8.07, \a% - 1| < .52, and /3 B < 50/3^. 
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Proof. To prove the first claim, we note since di+ = 0, \dij\ < 1 and I < 8, letting n = n — I, 

= I E dij | = | - E °M ^ 8 and hence = | E ^+1 - 8n. 

j£T jeT i$T 

This inequality implies \hb\ — \b++\/(n — 1) < 8.07 proving the first claim. 
Using a 2 D — \ and ([5]) we obtain 

n-2 ^ , Q (n-2)(n-l)(n-3) 



V d 2 - 

'(n-l)(n-3) ^ y 

l<i,j<n 



(n - 2)(n- l)(n - 3) 
Using the above bound, §5§ and (|136p . we obtain 
n-2 



G [1,1.01] when n > 1000. 



V d 2 

(n-l)(n-3) ^ « 

l<i,j<n 



(136) 



(137) 



(138) 



< 



< 



< 



(n-l)(n-3) 
2 

(7T- l)(n-3) 
32ti 



(--2)1 E 4-E rf 

{igT}n{jgT} 



(n-2) E 

{ieTjuyeT} 
128tT 2 256n 



,2 64n 2 
4 + 

J 71—1 



n- 1 



-2$>! 

i4T 



128ti 



272(n-3) (ti- l) 2 (?i - 3) (n-l)(n-3) 
< .51 when ti > 1000, 

where for (I139[) , we use |T| < 8, /3d < 7i/(50 x 90) and Holder's inequality to obtain 



E 



n 



i 2 - 



i 



(139) 
(140) 



< 2 x 8^713 since £™ =1 4 < (E"=i My '| 3 )*n* < Pb n " and |T| < 8 

71 

< 16— since (3 D < ti/4500. 

From (|140[) and (|137[) . we obtain |cr^ — 1| < .52 for n > 1000, proving the second claim of the lemma. 
To prove the final claim we observe = | E^/eT^yl" Thus, by (|117[) and |T| < 8, 

E^r l^+l 3 = E^r I Eierdtf I 3 ^ 64 E^r E jeT < 64 ^ and similarly 
M++I 3 = IE^tM 3 <« 2 E^rl^+l 3 <64n 2 fe. 
These observations, along with the fact that a B > .48, yield 



(141) 



E{igT}n{j£T} IM 3 



3.1 e 

{^T}n{^T} 



<3.i e fei 

{^r}n{j^T} 
h+ b +j 



(n-2) (n-2) (n-l)(n-2) 



< 3.1 x4 2 El 6 yl 3 + 2 "E 



(n - 2) s 



71 2 



++I 



< 49.6 /3 



128- 



r /3 D +64/3 D — 



((n-l)(n-2))3 



using (|117p 



—4 



'(n-2) 3 ^ ^"((7T-1)(tT-2)) 3 
< 50p D when ti > 1000, since I < 8. 



using (|141l) 
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This completes the proof. □ 
Lemma 5.5. Consider a nonnegative sequence {a n , n £ Z} such that a n = for all n < and 

a n < c + a max a n -i for all n > 1, (142) 

(£{4,6,8} 

where c > and a G (0, 1/3). Then, with b — max{c, ai(l — 3a)}, /or all n > 1 

b 



a n < 



1 - 3a 

Proof. Letting b n ,n> 1 be given by the recursion 

6 n+ i = 3afe„ + b for n > 1, with b\ = ax, 
explicitly solving yields for n > 1 

b n = ci(3a)™ + c 2 where c\ = a ^(\-za) and ° 2 = 

Since c\ < and 3a < 1, 6„ is increasing and bounded above by Ci. Hence it suffices to show a n <b n . 
Since a n is nonnegative, f)142f) implies that 

a„ < c + a a-n-1 for n > 1- (143) 

Ze{4,6,8} 

We show a m < b m for all 1 < m < n by induction. When n < 4, since a n _; = for ^ £ {4, 6, 8} we have 

a n < c < b < b n . 

Now supposing the claim is true for some n > 4, by (|f 431) and the monotonicity of b n ,n > 1, we obtain 
a n +l < c + a ^ On+l-J < + a - b + 3afe 

rt—3 ~t~ 3a6 n — b n -\.\. 

/e{4,6,8} ;e{4,6,8} 
This completes the proof. □ 
Lemma 5.6. With J 1 (7,71) defined as in US1\) . 

(J 1 (7, n) < 2804655- when n>n = 1000. 
n 

Proof. We will consider a smoothed family of indicator functions indexed by A > 0, namely 

( 1 if x < z 

h z ,\(x) = < l + (z-x)/\ ifx£(z,z + X] (144) 
[ if x > z + A. 

Also, define 

h z ,o(x) = l(-oo, z ](a;). 
Let f z ,x denote the solution to the following Stein equation 

f'(x) - xf(x) = h^ x - $(h z , X ) where $(h z , x ) = E(h z .x{Z)) and Z ~ JV(0, 1). (145) 
We will need the following key inequality about f z x from pQ, 



\f'z,x(x +V)~ f'z,x( x )\ ^ \V\ (i + M + \J q l lz,z+X]{x + ry)d^j for any A > 0. 



(146) 
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We consider D £ M^ij) with fto/n < eo/51 and n > 1000. Let W = Yn=i ^mH) as before. Using the fact 
that 

h z .o < hz,\ < h z+ \.o, 
and recalling the definition of §n in (11291) . we obtain 

So < sup \E(h g , x (W)) ~ *(h z ,x)\ + A/V2^. (147) 

Z 

From ([155]) . (JUJ and Var(W) = 1, we obtain 

sup\E(h z , x (W) - $(h z , x )\ = sup\E(f' z> x(W))-E(Wf Zt x(W))\ 

Z Z 

= S up\E(f' Zi x(W)) - E(f> tX (W*))\ 

Z 

< S upE\f' z ,x(W*)-f' z ,x(W)\. (148) 

z 

From (|147|) . f|148|) and (|146j) . we obtain the following 

Sd < sup E\f' z<x (W*)- f z ^(W)\+X/V2^ 

Z 

< supsjV*- W\ (l + 2\W\ + jJ^ l [z ^+x]{W + r{W* -W))drX^ + X/V2^ 
= E\W* -W\ + 2E(\W\\W* ~W\) 

+ supjE (\W* - W\ l [z , z +x]{W + r(W* -W))dr^j + A/V^r 
:= A 1 +A 2 +A 3 + A/V2^ say. (149) 
First, Ax is bounded using Theorem 14.21 as follows 

Ax = E\W* - W\ < 112^ + 672^ + 192^ 

< 113— since n > n = 1000. (150) 

71 

Next we bound ^2- From Theorem 14. 21 we obtain, 

TU*-VK = {UW^ + {l-U)W t )-W 

= (U{S + T^) + {l-U){S + T t ))-(S + T) = UT> +{l-U)T t -T. (151) 

Let I = (/t, Jt ) jft ) it j7r (7t) j7r (jt) j7r (jft) )7r (it)) an d I be as defined in Lemma [3J2] Thus, by (gTJ), the 
right hand side of (|151j) . and hence TU* — VF, is measurable with respect to J' = {I, J7}. Furthermore, since 
supj j < 1 and |I| < 8, we have 

\W\ = \S + T\< \S\ + \T\ = \S\ + \J2 \<\S\ + J2 1**0)1 ^ l 5 l + 8 - 

Now, by the definition of A 2 , and that U is independent of S and I, 

A 2 = 2E{\W\\W* - W\) 

= 2E(\W* - W\E{\W\\,f)) < 2E (\W* - W\E(\S\ + 8\S)) 



< 2E[\W* -W\x/E(S 2 \I)) +16E\W* -W\. (152) 



In the following, for i a realization of I, let |i| denote the number of distinct elements of i. Since 5 = 
Y)j£r. ^iTr(i) an d 7r is chosen from II n uniformly at random, conditioned on I = i, S has the same distribution 
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as Yli0^iO{i)> where B = ((fry)) n -|i|x7i-|i| is the matrix formed by removing rows and columns of D that 
occur in i and 9 is chosen uniformly from LT n _|ii. Since |i| < 8 and fio/n < cq/51 < eo/50, Lemma [5^41 yields 
\hb\ < 8.07 and a 2 B < 1.52, and hence 

E\Y B \ 2 < 1.52 + 8.07 2 = 66.6449 when n > 1000. (153) 

Using (|153[) . we obtain 

E(S 2 \I = i) = E\Y B \ 2 < 67 for alii. 
Thus using (|152[) and (I150[) . we obtain 

A 2 < 33E\W* - W\ < 3729— . (154) 

n 

Finally, we are left with bounding ^3. First we note that 

W + r{W*-W) = rW* + (l-r)W 

= r(S + UT> + (1 - U)T X ) + (1 - r)(S + T) 

= S + rUT^ + r(l - U)T X + (1 - r)T 

= S + g r where g r = rUT^ + r(l - J7)Tt + (1 - r)T. 

Now, from the definition of ^3, again using that W — W* is J' measurable, 

A 3 = auj£>±E(\W-W\J 1 {z , z+ x](W + r(W* -W))dS) 

= sup ~e(\W-W*\E{ J l [z , z+ x](W + r(W* -W))dr\JP)\ 
= sup jE (aw ~ w *\ J q P ( W + r ( w * ~W) e[z,z + \]\j?)dr^j 

1 r 1 

= sup -E(\W-W*\ P(S + g r e [z,z + \]\S)dr) 

* A Jo 
1 f 1 

= sup-E(\W-W*\ P(S e[z-g r ,z + \-g r }\j?)dr) 

z A JO 

1 r 1 

< / supP(Se [z-g r ,z + \-g r ]\S)dr) 

A Jo z 

1 r 1 

= -E(\W-W*\ sup P(Se [z,z + \}\J)dr) (155) 
A J z 

= ^-E(\W ~W*\supP(S £ [z,z + \]\S)) 
A z 

= \e{\W -W*\supP(S £ [z,z + X]\l)), (156) 

where to obtain equality in (|155[) we have used the fact that g r is measurable with respect to J? for all r. 
It remains only to bound P(S £ [z,z + A]|I). In the following calculations = bij/a B as before. Since 
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Po/n < eo/51, Lemma yields as > 1/2. Hence, 

supP(S e [z,z + A]|I = i)) = su P P(^6 ie(4) e [z,z + A]) 

= sup PQ^— — e [—,-—]) 



< 



supP(£^G[A- + 2A]) 

If 1 

= sup P(Y ^ e [z,z + 2A]) 

z z — ' <JB 

Ifl 

= 8U P P(V& ie( i) € [2,Z + 2A]) 

Z A- 

= sup P(Y S e [z,z + 2X]). (157) 

z 

The equality (| 15T[) holds as when computing by we have that an d = J2j0 b+j do not 

depend on 9. Recalling that the distribution function of Yg is denoted by Pg, we have, from the definition 
of 5(7, n) in COP, 

P(Yg e [z,z + 2A]) < |Fg(z + 2A)-$(z + 2A)| + |Pg(z)-$(z)| + |$(z + 2A)-$(z)| 

< 2 {g + » ,158) 



Note that /3g = /3b and by Lemma \5A\ 

[77 = [77 < 50 pry < 51 < £ ■ 

n — 1 1 1 7T, — |i| n — |i| n 
Hence using the fact that B g M n _^(f3B) and applying Lemma f5 .31 we obtain for n > 1008 

S s < ( 5 1 (22/3g)+36^- =5 1 (22^B,n-|i|) + 36^-. (159) 
n — |i| n — l 

Inequalities (| 158[) and (|159[) imply 

2A 

P(Yge[z,z + 2A]) < 2S S + — 

V Z7T 



< 2<5 1 (22/3 B ,n- |i|) + 72 ' ' i; " A 



< 2^(22 x 50#D,n- |i| 



n-\i\ ^/2^ 
72 x 50(3 D 2A 



n-|i| 



< 2 max 5 1 (fc 1| S I3 ,n-Z) + ^^ + -^ ! (160) 
ie{4,6 : 8} n V27T 



where fci — 1100 and fc 2 = 3630. As (|160[) does not depend on z or i, it bounds sup z P(S € [z, z + A]|I)) in 
(HSU). 

Now using (fT56)) . (fT60|) and (fT50")) . we obtain 

A 3 < i(2 max i 1 ^, n - I) + fe — + -^L)E\W - W*\ 
A ;e{4,6 : 8} n v27r 



< 1(2 max 5 1 {k 1 (3 D ,n-l) + k 2 ^- + 4^)113— . (161) 

A ie{4,6 : 8} n y/2n n 
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Combining ([150)1 . (fl"53)l . (|W)) and using ([Hi?)) we obtain for n > 1008, 

< 3842^ + 1(2 max S 1 (k 1 j3 D , n - I) + fc 2 ^ + -?L)113 A ' ' 



n A ;g{4,6,8} n ^/2n n ^/2tt 



< 3842^ + 1^(2 max 5\ki0 D , n - I) + k 2 - + -^=) 



A 



n nA ie{4,6 : 8} n v 7 ^- V^r 

since fi D <1 for all D e -^(7). Setting A = (113 x 8)^,7/71, we obtain for n > 1008, 

c ^ 7 . 1 S 1 (k 1 (3 D ,n- 1) 

o D < c — I — - max , 162) 

n 4 ;e{4,6,8} k x 

where c = 400,665. Since c > 51/eo, its clear that if we consider 13 € M*(7), with ftn/n > e o/51, then 
S D < cr)/n. Hence (I162p holds for all D e M^(7) with n > 1008. Taking supremum over D G M*(-y), we 
have, for n > 1008, 

x i, n ^ 7 . 1 ^fciT^n-Q 

(7,n) < c — h - max , 

n 4 ;g{4,6,8} fci 

where c = 400, 665. Now multiplying by n/7 and taking supremum over 7 we obtain 

^(W n") 2 t^h- n — I) 
supn u ' ; < c + - max sup(n - I)— ^ i for all n > 1008. (163) 

7 7 7 /e{4,6,8} 7 7 

If D e (7) and n > 1000 then (|TTB|) shows that fi D > 2, and hence D g M^(j) for all 7 < 2. Since 
<5 1 (7, n) < 1 for all ri e N, for 1000 < n < 1008 we have 

supn w ' 7 = supn w ' ; < 1008 sup w ' ; < 504. 

7 7 7>2 7 7>2 7 

Since c > 504, we conclude (|163p holds for n > 1000, that is 

dV-r nl 2 8 1 {^ n — I) 
supn w ' ; < c + - max sup(n - I)— — - for all n > 1000. (164) 

7 7 7 /e{4,6,8} 7 7 

Letting s„ = sup 7 n<5 1 (7, n)/7 and a„ = s„ + gg9 for n > 1, and a n = for n < 0, (|164[) gives 

2 

a n < c H — max a n _2 for n > 1. 

7 /G{4,6,8} 

Using Lemma [5.51 with a = 2/7, c = 400,665 and noting that ai — nS 1 ^, 1000)/7 < 500 since 7 > 2, we 
obtain 

ai(l — 3a) < 500(1 — 6/7) < c which yields b = max{c, ai(l — 3a)} = c, 

and therefore 

a n < —^r = t-V = 2804655. 
1 — 3a 1 — da 

This completes the proof of the lemma. □ 
Lemmas 15.61 15.31 and remarks following (|116|) yield 

\\F W - $IU < K— with n > 1000, 

n 

where we can take K = 22 x 2804655 + 36 = 61, 702, 446. This proves Theorem 15.11 and hence completes the 
proof of Theorem [271] as well. 
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